KMID : 0917520020090010207
|
|
Journal of Speech Sciences 2002 Volume.9 No. 1 p.207 ~ p.215
|
|
Perceptual Evaluation of Duration Models in Spoken Korean
|
|
Chung Hyun-Song
|
|
Abstract
|
|
|
Perceptual evaluation of duration models of spoken Korean was carried out bvased on the Classification and Regression Tree (CART) model for text-to-speech conversion. A reference set of durations was produced by a commercial text-to-speech synthesis system for comparison. The duration model which was built in the previous research (Chung & Huckvale, 2001) was applied to a Korean language speech synthesis diphone database, "Hanmal (HN 1.0)". Tfhe synthetic speech produced by the CART duration model was preferred in the subjective preference test by a small margin and the synthetic speech from the commercial system was superior in the clarity test. In the course of preparing the experiment, a labeled database of spoken Korean with 670 sentences was constructed. As a result of the experiment, a trained duration model for speech synthesis was obtained. Tfhe "Hanmal" diphone database for Korean speech synthesis was also developed as a by-product of the perceptual evaluation.
|
|
KEYWORD
|
|
|
|
FullTexts / Linksout information
|
|
|
|
Listed journal information
|
|
|